An Accurate MDS-Based Algorithm for the Visualization of Large Multidimensional Datasets
نویسنده
چکیده
A common task in data mining is the visualization of multivariate objects on scatterplots, allowing human observers to perceive subtle inter-relations in the dataset such as outliers, groupings or other regularities. Leastsquares multidimensional scaling (MDS) is a well known Exploratory Data Analysis family of techniques that produce dissimilarity or distance preserving layouts in a nonlinear way. In this framework, the issue of visualizing large multidimensional datasets through MDS-based methods is addressed. An original scheme providing very accurate layouts of large datasets is introduced. It is a compromise between the computational complexity O(N) and the accuracy of the solution that makes it suitable both for visualization of fairly large datasets and preprocessing in pattern recognition tasks.
منابع مشابه
Improving the efficiency of multidimensional scaling in the analysis of high-dimensional data using singular value decomposition
MOTIVATION Multidimensional scaling (MDS) is a well-known multivariate statistical analysis method used for dimensionality reduction and visualization of similarities and dissimilarities in multidimensional data. The advantage of MDS with respect to singular value decomposition (SVD) based methods such as principal component analysis is its superior fidelity in representing the distance between...
متن کاملMultidimensional Scaling in the Poincaré Disk
Multidimensional scaling (MDS) is a class of projective algorithms traditionally used to produce twoor three-dimensional visualizations of datasets consisting of multidimensional objects or interobject distances. Recently, metric MDS has been applied to the problems of graph embedding for the purpose of approximate encoding of edge or path costs using node coordinates in metric space. Several a...
متن کاملQuestVis and MDSteer : The Visualization of High - Dimensional Environmental Sustainability Data
The visualization of large high-dimensional datasets is an active topic within the research area of information visualization (infovis), a research area that studies the visual representations of complex abstract datasets. My thesis presents two infovis systems that were motivated by the desire to explore a 294-dimensional environmental sustainability dataset. Our collaborators developed the en...
متن کاملA Hybrid Method for Segmentation and Visualization of Teeth in Multi-Slice CT scan Images
Introduction: Various computer assisted medical procedures such as dental implant, orthodontic planning, face, jaw and cosmetic surgeries require automatic quantification and volumetric visualization of teeth. In this regard, segmentation is a major step. Material and Methods: In this paper, inspired by our previous experiences and considering the anatomical knowledge of teeth and jaws, we prop...
متن کاملNew Developments of Nonlinear Projections for the Visualization of Structures in Nonvectorial Data Sets
Aalto University, P.O. Box 11000, FI-00076 Aalto www.aalto.fi Author Teuvo Kohonen Name of the publication New Developments of Nonlinear Projections for the Visualization of Structures in Nonvectorial Data Sets Publisher School of Science Unit Department of Information and Computer Science Series Aalto University publication series SCIENCE + TECHNOLOGY 8/2011 Field of research Computer science ...
متن کامل